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SEQUENCE PROTOCOL 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: Werner Lubitz 

(B) ROAD: Schoenborngasse 

(C) CITY: Wien 

<E) COUNTRY: Austria 
(F) ZIP CODE: 1080 

(A) NAME: Uwe Sleytr 

(B) ROAD: Parhamerplatz /lO 

(C) CITY: Wien 

(E) COUNTRY: Austria 

(F) ZIP CODE: 1170 



f2/7 



(ii) TITLE OF INVENTION 



Recombinant expression of 
S-layer proteins 




(iii) NUMBER OF SEQUENCES: 

(iv) COMPUTER READABLE FOl 

(A) DATA CARRIER: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: PateAtIn Release #1.0, version 
#1.3A) (EPA) 

INFORMATION FOR SEQ ID /lO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3687/ base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: lfi.near 

(vi) INITIAL ORIGIN: 

(A) ORGANISM: feacillus stearothermophilus 

(B) STRAIN: PW2 

(vii) IMMEDIATE ORIG±N: 

(B) CLONE (S) :/ sbsA 

( ix) CHARACTERISTI] 

(A) NAME/KEY/: CDS 

(B) POSITION: 1. . 3684 

(ix) CHARACTERIS-yiC: 

(A) NAME/K^Y: sig_peptide 

(B) POSITION: 1. . 90 



V 
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( ix) CHARACTERISTIC : 

(A) NAME/KEY: mat_peptide 

(B) POSITION: 91. .3 684 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1 



10 



ATG GAT AGG AAA AAA GCT GTG AAA CTA GCA ACA OCA AGT GCT ATT GCA 4S 

Met Asp Arg Lys Lys Ala Val Lys Leu Ala Thr Ala Ser Ala lie Ala 
15 -30 -25 -20 -15 

GCA AGT GCA TTT GTC GCT GCA AAT CCA AAC GCT TCT GAA GCG GCT ACA Qfi 

Ala Ser Ala Phe Val Ala Ala Asn Pro Asn Ala Ser Glu Ala Ala Thr 
-10 -5 -L 

20 

GAT GTA GCA ACA GTA GTA AGO CAA GCA AAA GCA CAG TTC AAA AAA GCA 14 4 

Asp Val Ala Thr Val Val Ser Gin Ala Lys Ala Gin Phe Lys Lys Ala 
5 10 15 

25 TAC TAT ACT TAG AGC CAT ACA GTA ACQ GAA ACT GGT GAA TTC CCA AAC 192 

Tyr Tyr Thr Tyr Ser His Thr Val Thr Glu Thr Gly Glu Phe Pro Asn 
20 25 30 



30 



ATT AAC GAT GTA TAT GCT GAA TAC AAC AAA GCG AAA AAA CGA TAC CGT 24 0 

lie Asn Asp Val Tyr Ala Glu Tyr Asn Lys Ala Lys Lys Arg Tyr Arg 
Q 35 40 45 50 

YU GAT GCG GTA GCA TTA GTG AAT AAA GCA GGT GGC GCG AAA AAA GAC GCT 288 

:is Asp Ala Val Ala Leu Val Asn Lys Ala Gly Gly Ala Lys Lys Asp Ala 

m ^5 GO 65 

TAC TTA GCT GAT TTA CAA AAA GAA TAT GAA ACT TAC GTT TTC AAA GCA 336 
Tyr Leu Ala Asp Leu Gin Lys Glu Tyr Glu Thr Tyr Val Phe Lys Ala 
70 75 SO 

40 

AAC CCT AAA TCT GGC GAA GCT CGT GTA GCA ACT TAC ATC GAT GCT TAC 3 84 

Asn Pro Lys Ser Gly Glu Ala Arg Val Ala Thr Tyr He Asp Ala Tyr 
85 90 95 

4 5 AAC TAT GCA ACA AAA. TTA GAC GAA ATG CGC CAA GAG CTA GAG GCT GCT 4 32 
Asn Tyr Ala Thr Lys Leu Asp Glu Met Arg Gin Glu Leu Glu Ala Ala 
100 105 110 

2^ ^ ^^'^ "^^^ ^ CAA TAC TAT CAC AAA ATT 4 80 

50 Val Gin Ala Lys Asp Leu Glu Lys Ala Glu Gin Tyr Tyr His Lys He 

120 125 . 130 

CCT TAT GAA ATT AAA ACT CGC ACA GTC ATT TTA GAT CGC GTA TAT GGT 52 8 

Pro Tyr Glu He Lys Thr Arg Thr Val He Leu Asp Arg Val Tyr Gly 
135 140 145 

AAA ACA ACT CGT GAT TTA CTT CGC TCT ACA TTT AAA GCA AAA GCA CAA 57 6 

Lys Thr Thr Arg Asp Leu Leu Arg Ser Thr Phe Lys Ala Lys Ala Gin 
ISO 155 160 

60 

GAA CTT CGC GAC AGC TTA ATT TAT GAT ATT ACC GTT GCA ATG AAA GCG 624 

Glu Leu Arg Asp Ser Leu He Tyr Asp ^le Thr Val Ala Met Lys Ala 
165 170 175 

65 N 
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CGC GAA GTA CAA GAC GCT GTG AAA GCA GGC AAT TTA GAG AAA GCT AAA 6 72 

Arg Glu Val Gin Asp Ala Val Lys Ala Gly Asn Leu Asp Lvs Ala Lvs 
180 185 190 

5 GCT GCT GTT GAT CAA ATC AAT CAA TAC TTA CCA AAA GTA ACA GAT GCT 72 0 

Ala Ala Val Asp Gin He Asn Gin Tyr Leu Pro Lys Val Thr Asd Ala 
195 200 205 210 

TTC AAA ACT GAA CTA ACA GAA GTA GCG AAA AAA GCA TTA GAT GCA GAT 76 8 

10 Phe Lys Thr Glu Leu Thr Glu Val Ala Lys Lys Ala Leu Asp Ala Asp 

215 220 225 

GAA GCT GCG CTT ACT CCA AAA GTT GAA AGT GTA ACT GCG ATT AAC ACT 81G 
Glu Ala Ala Leu Thr Pro Lys Val Glu Ser Val Ser Ala lie Asn Thr 
15 230 235 240 

CAA AAC AAA GCT GTT GAA TTA ACA GCA GTA CCA GTG AAC GGA ACA CTA 864 
Gin Asn Lys Ala Val Glu Leu Thr Ala Val Pro Val Asn Gly Thr Leu 
245 250 255 
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AAA TTA CAA CTT TCA GCT GCT GCA AAT GAA GAT ACA GTA AAC GTA AAT 912 
Lys Leu Gin Leu Ser Ala Ala Ala Asn Glu Asp Thr Val Asn Val Asn 
260 265 270 

2 5 ACT GTA CGT. ATC TAT AAA GTG GAC GGT AAC ATT CCA TTT GCC CTT AAT 96 0 

Thr Val Arg He Tyr Lys Val Asp Gly- Asn He Pro Phe Ala Leu Asn 
275 280 285 290 

ACG GCA GAT GTT TCT TTA TCT ACA GAC GGA AAA ACT ATC ACT GTG GAT 1008 
Thr Ala Asp Val Ser Leu Ser Thr Asp Gly Lys Thr He Thr Val Asp 
295 300 305 

GCT TCA ACT CCA TTC GAA AAT AAT ACG GAG TAT AAA GTA GTA GTT AAA 105 6 

Ala Ser Thr Pro Phe Glu Asn Asn Thr Glu Tyr Lys Val Val Val Lvs 
m 310 315 320 

GGT ATT AAA GAC AAA AAT GGC AAA GAA TTT AAA GAA GAT GCA TTC ACT 1104 
^■^ Gly He Lys Asp Lys Asn Gly Lys Glu Phe Lys Glu Asp Ala Phe Thr 

325 330 335 

40 

TTC AAG CTT CGA AAT GAT GCT GTA GTT ACT CAA GTG TTT GGA ACT AAT 1152 
Phe Lys Leu Arg Asn Asp Ala Val Val Thr Gin Val Phe Gly Thr Asn 
340 345 350 

4 5 GTA ACA AAC AAC ACT TCT GTA AAC TTA GCA GCA GGT ACT TTC GAC ACT 12 0 0 

Val Thr Asn Asn Thr Ser Val Asn Leu Ala Ala Gly Thr Phe Asp Thr 
355 360 365 370 

GAC GAT ACT TTA ACA GTA GTA TTT GAT AAG TTG TTA GCA CCT GAA ACT 12 4 8 

5 0 Asp Asp Thr Leu Thr Val Val Phe Asp Lys Leu Leu Ala Pro Glu Thr 

375 380 385 

GTA AAC AGC TCG AAC GTT ACT ATT ACA GAT GTT GAA ACT GGA AAA CGC 12 9 5 

Val Asn Ser Ser Asn Val Thr He Thr Asp Val Glu Thr Gly Lvs Ara 
" 390 395 400 

ATT CCA GTA ATT GCA TCT ACT TCT GGT TCT ACA ATT ACT ATT ACG TTA 13 44 

He Pro Val He Ala Ser Thr Ser Gly Ser Thr He Thr He Thr Leu 
405 410 415 

AAA GAA GCG TTA GTA ACT GGT AAA CAA TAT AAA CTT GCT ATC AAT AAT 13 92 

Lys Glu Ala Leu Val Thr Gly Lys Gin Tyr Lys Leu Ala He Asn Asn 
420 425 430 

6 5 GTT AAA ACA TTA ACT GGT TAC AAT GCA GAA GCT TAC GAG TTA GTG TTC 14 4 0 
Val Lys Thr Leu Thr Gly Tyr Asn Ala Glu Ala Tyr Glu Leu Val Phe 
435 440 . 445 450 



60 
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ACT GCA AAC GCA TCA GCA CCA ACT GTT OCT ACC OCT CCT ACT ACT TTA 14 8 Q 

Thr Ala Asn Ala Ser Ala Pro Thr Val Ala Thr Ala Pro Thr Thr Leu 
455 460 465 

5 GGT GGT ACA ACT TTA TCT ACT GGT TCT CTT ACA ACA AAT GTT TGG GGT 153 6 

Gly Gly Thr Thr Leu Ser Thr Gly Ser Leu Thr Thr Asn Val Trp Gly 
470 475 480 

AAA TTG GCT GGT GGT GTG AAT GAA GCT GGA ACT TAT TAT CCT GGT CTT 15 84 

10 Lys Leu Ala Gly Gly Val Asn Glu Ala Gly Thr Tyr Tyr Pro Gly Leu 
485 490 495 

CAA TTC ACA ACA ACG TTT GCT ACT AAG TTA GAC GAA TCT ACT TTA GCT 163 2 

Gin Phe Thr Thr Thr Phe Ala Thr Lys Leu Asp Glu Ser Thr Leu Ala 

IS 500 505 510 

GAT AAC TTT GTA TTA GTT GAA AAA GAA TCT GGT ACA GTT GTT GCT TCT 16 8 0 

Asp Asn Phe Val Leu Val Glu Lys Glu Ser Gly Thr Val Val Ala Ser 
515 520 525 530 
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GAA CTA AAA TAT AAT GCA GAC GOT AAA ATG GTA ACT TTA GTG CCA AAA 17 2 8 

Glu Leu Lys Tyr Asn Ala Asp Ala Lys Met Val Thr Leu Val Pro Lys 
535 540 545 

25 GCG GAC CTT. AAA GAA AAT ACA ATC TAT CAA ATC AAA ATT AAA AAA GGC 1776 
Ala Asp Leu Lys Glu Asn Thr He Tyr. Gin He Lys He Lys Lvs Glv 
550 555* 560 

TTG AAG TCC GAT AAA GGT ATT GAA TTA GGC ACT GTT AAC GAG AAA ACA 1824 
30 Leu Lys Ser Asp Lys Gly He Glu Leu Gly Thr Val Asn Glu Lys Thr 
565 570 575 

TAT GAG TTC AAA ACT CAA GAC TTA ACT GCT CCT ACA GTT ATT AGC GTA 1872 
Tyr Glu Phe Lys Thr Gin Asp Leu Thr Ala Pro Thr Val He Ser Val 
35 580 585 590 

ACG TCT AAA AAT GGC GAC GCT GGA TTA AAA GTA ACT GAA GCT CAA GAA 192 0 
Thr Ser Lys Asn Gly Asp Ala Gly Leu Lys Val Thr Glu Ala Gin Glu 

595 SOO 605 610 

40 

TTT ACT GTG AAG TTC TCA GAG AAT TTA AAT ACA TTT AAT GCT ACA ACC 196 8 

Phe Thr Val Lys Phe Ser Glu Asn Leu Asn Thr Phe Asn Ala Thr Thr 

615 . . 620 625 

45 GTT TCG GGT AGC ACA ATC ACA TAG GGT CAA GTT GCT GTA GTA AAA GCG 2 016 

Val Ser Gly Ser Thr He Thr Tyr. Gly Gin Val Ala Val Val Lys Ala 
630 635 . 640 

GGT GCA AAC TTA TCT GCT CTT ACA GCA AGT GAC ATC ATT CCA GCT AGT 2 0 64 

50 Gly Ala Asn Leu Ser Ala Leu Thr Ala Ser Asp He He Pro Ala Ser 
645 650 S55 

GTT GAA GCG GTT ACT GGT CAA GAT GGA ACA TAC AAA GTG AAA GTT GCT 2112 
Val Glu Ala Val Thr Gly Gin Asp Gly Thr Tyr Lys Val Lys Val Ala 
55 660 665 670 

GCT AAC CAA TTA GAA CGT AAC CAA GGG TAC AAA TTA GTA GTG TTC GGT 2160 

Ala Asn Gin Leu Glu Arg Asn Gin Gly Tyr Lys Leu Val Val Phe Gly 

675 680 685 690 

60 

AAA GGT GCA ACA GCT CCT GTT AAA GAT GCT GCA AAT GCA AAT ACT TTA 22 0 8 

Lys Gly Ala Thr Ala Pro Val Lys Asp Ala Ala Asn Ala Asn Thr Leu 
695 700 705 

" S?*^ ^^"^ ^^"^ "^^"^ ^"^^ ACT GAA GGT CAA GAC GTA ACA 2256 

Ala Thr Asn Tyr He Tyr Thr Phe Thr Thr Glu Gly Gin Asp Val Thr 
710 715 . 720 
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GCA CCA ACQ GTT ACA AAA GTA TTC AAA GGT GAT TCT TTA AAA GAC OCT 
Ala Pro Thr Val Thr Lys Val Phe Lys Gly Asp Ser Leu Lys Asp Ala 
725 730 735 



2304 



5 GAT GCA GTT ACT ACA CTT ACG AAC GTT GAT GCA GGT CAA AAA TTC ACT 2 3 5'' 

Asp Ala Val Thr Thr Leu Thr Asn Val Asp Ala Gly Gin Lys Phe Thr 
740 745 750 

ATC CAA TTT AGC GAA GAA TTA AAA ACT TCT AGT GGT TCT TTA GTG GGT 24 00 

ID He Gin Phe Ser Glu Glu Leu Lys Thr Ser Ser Gly Ser Leu Val Glv 
755 760 765 770 

GGC AAA GTA ACT GTC GAG AAA TTA ACA AAC AAC GGA TGG GTA GAT GCT 2 44 B 

Gly Lys Val Thr Val Glu Lys Leu Thr Asn Asn Gly Trp Val Asp Ala 
IS 775 780 785 

GGT ACT GGA ACA ACT GTA TCA GTT GCT CCT AAG ACA GAT GCA AAT GGT 24 96 

Gly Thr Gly Thr Thr Val Ser Val Ala Pro Lys Thr Asp Ala Asn Gly 
790 795 800 
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AAA GTA ACA GCT GCT GTG GTT ACA TTA ACT GGT CTT GAC AAT AAC GAC 2 54 4 

Lys Val Thr Ala Ala Val Val Thr Leu Thr Gly Leu Asp Asn Asn Asp 
805 810 815 

2 5 AAA GAT GCG AAA TTG CGT CTG GTA GTA GAT AAG TCT TCT ACT GAT GGA 2 5 92 

Lys Asp Ala Lys Leu Arg Leu Val Val Asp Lys Ser Ser Thr Asp Glv 
820 825 ■ 830 

ATT GCT GAT GTA GCT GGT AAT GTA ATT AAG GAA AAA GAT ATT TTA ATT 2 64 0 

30 He Ala Asp Val Ala Gly Asn Val He Lys Glu Lys Asp He Leu He 
B35 840 B45 850 

CGT TAC AAC AGC TGG AGA CAC ACT GTA GCT TCT GTG AAA GCT GCT GCT 2688 
Arg Tyr Asn Ser Trp Arg His Thr Val Ala Ser Val Lys Ala Ala Ala 
35 B55 860 865 

GAC AAA GAT GGT CAA AAC GCT TCT GCT GCA TTC CCA ACA AGC ACT GCA 2736 
Asp Lys Asp Gly Gin Asn Ala Ser Ala Ala Phe Pro Thr Ser Thr Ala 
870 87S 880 

40 

ATT GAT ACA ACT AAG AGC TTA TTA GTT GAA TTC AAT GAA ACT GAT TTA 2 7 84 

He Asp Thr Thr Lys Ser Leu Leu Val Glu Phe Asn Glu Thr Asp Leu 
885 890 B95 

4 5 GCG GAA GTT AAA CCT GAG AAC ATC GTT GTT AAA GAT GCA GCA GGT AAT 28 32 

Ala Glu Val Lys Pro Glu Asn He Val Val Lys Asp Ala Ala Gly Asn 
900 905 910 

GCG GTA GCT GGT ACT GTA ACA GCA TTA GAC GGT TCT ACA AAT AAA TTT 2 8 80 

50 Ala Val Ala Gly Thr Val Thr Ala Leu Asp Gly Ser Thr Asn Lys Phe 
915 920 925 930 

GTA TTC ACT CCA TCT CAA GAA TTA AAA GCT GGT ACA GTT TAC TCT GTA 2928 
Val Phe Thr Pro Ser Gin Glu Leu Lys Ala Gly Thr Val Tyr Ser Val 
^5 935 940 945 

ACA ATT GAC GGT GTG AGA GAT AAA GTA GGT AAC ACA ATC TCT AAA TAC 2 976 

Thr He Asp Gly Val Arg Asp Lys Val Gly Asn Thr He Ser Lvs Tvr 
950 955 9S0 

€0 

ATT ACT TCG TTC AAG ACT GTA TCT GCG AAT CCA ACG TTA TCT TCA ATC 3 0 24 

He Thr Ser Phe Lys Thr Val Ser Ala Asn Pro Thr Leu Ser Ser He 
9S5 970 975 

6S AGC ATT GCT GAC GGT GCA GTT AAC GTT GAC CGT TCT AAA ACA ATT ACA 3 0 72 

Ser He Ala Asp Gly Ala Val Asn Val Asp Arg Ser Lys Thr He Thr 
980 985 990 
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ATT GAA TTC AGC GAT TCA GTT CCA AAC CCA ACA ATC ACT CTT AAG AAG 
He Glu Phe Ser Asp Ser Val Pro Asn Pro Thr He Thr Leu Lvs Lvs 
955 1000 1005 iblO 

5 GOT GAC GGA ACT TCA TTT ACT AAT TAC ACT TTA GTA AAT GTA AAT AAT 
Ala Asp Gly Thr Ser Phe Thr Asn Tyr Thr Leu Val Asn Val Asn Asn 
103.5 1020 1025 

GAA AAT AAA ACA TAC AAA ATT GTA TTC CAC AAA GGT GTA ACA CTT GAC 
10 Glu Asn Lys Thr Tyr Lys He Val Phe His Lys Gly Val Thr Leu Asp 
1030 1035 1040 

GAG TTT ACT CAA TAT GAG TTA GCA GTT TCA AAA GAT TTT CAA ACT GGT 
Glu Phe Thr Gin Tyr Glu Leu Ala Val Ser Lys Asp Phe Gin Thr Gly 
1045 1050 1055 



15 



20 



ACT GAT ATT GAT AGC AAA GTT ACA TTC ATC ACA GGT TCT GTT GCT ACT 
Thr Asp He Asp Ser Lys Val Thr Phe He Thr Gly Ser Val Ala Thr 
1060 1065 1070 

GAC GAA GTA AAA CCT GCT CTA GTA GGC GTT GGT TCA TGG AAT GGA ACA 
Asp Glu Val Lys Pro Ala Leu Val Gly Val Gly Ser Trp Asn Gly Thr 
10'^5 1080 1085 1D90 

25 AGC TAT ACT CAG GAT GCT GCA GCA ACA CCA CTT CGG TCT GTA GCT GAC 
Ser Tyr Thr Gin Asp Ala Ala Ala Thr Arg Leu Arg Ser Val Ala Asp 
1095 1100 1105 

TTC GTT GCG GAG CCA GTT GCC CTT CAA TTC TCA GAA GGT ATC GAT TTA 
30 Phe Val Ala Glu Pro Val Ala Leu Gin Phe Ser Glu Gly He Asp Leu 
1110 1115 1120 
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ACG AAT GCA ACT GTG ACA GTA ACA AAT ATT ACT GAT GAT AAA ACT GTT 
Thr Asn Ala Thr Val Thr Val Thr Asn He Thr Asp Asp Lys Thr Val 
1125 1130 1135 

GAA GTT ATT TCA AAA GAG AGT GTA GAC GCA GAC CAT GAT GCA GGT GCT 
Glu Val He Ser Lys Glu Ser Val Asp Ala Asp His Asp Ala Gly Ala 
1140 1145 2.1S0 

ACT AAG GAG ACA TTA GTA ATT AAC ACA GTT ACT CCT TTA GTA CTT GAT 
Thr Lys Glu Thr Leu Val He Asn Thr Val Thr Pro Leu Val Leu Asp 
1155 iieo 1165 1170 

4 5 AAC AGC AAG ACT TAT AAG ATT GTT GTA AGT GGA GTT AAA GAT GCA GCA 
Asn Ser Lys Thr Tyr Lys He Val Val Ser Gly Val Lys Asp Ala Ala 
1175 1180 1185 

GGT AAT GTT GCA GAT ACT ATT ACA TTC TAT ATT AAG TAA 
50 Gly Asn Val Ala Asp Thr He Thr Phe Tyr He Lys 
1190 1195 



3120 



3168 



3216 



3264 



3312 



3360 



3408 



3456 



3504 



3552 



3600 



3648 



3687 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1228 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) TYPE OF MOLECULE: protein 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 



so 
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Met Asp Arg Lys Lys Ala Val Lys Leu Ala Thr Ala Ser Ala He Ala 
-30 -25 -20 -15 

5 Ala Ser Ala Phe Val Ala Ala Asn Pro Asn Ala Ser Glu Ala Ala Thr 

-10 -5 1 

Asp Val Ala Thr Val Val Ser Gin Ala Lys Ala Gin Phe Lys Lys Ala 
5 10 15 

10 

Tyr Tyr Thr Tyr Ser His Thr Val Thr Glu Thr Gly Glu Phe Pro Asn 
20 25 30 

He Asn Asp Val Tyr Ala Glu Tyr Asn Lys Ala Lys Lys Arg Tyr Arg 
15 35 40 45 50 

Asp Ala Val Ala Leu Val Asn Lys Ala Gly Gly Ala Lys Lys Asp Ala 

55 60 65 

20 Tyr Leu Ala Asp Leu Gin Lys. Glu Tyr Glu Thr Tyr Val Phe Lys Ala 
70 75 80 

Asn Pro Lys Ser Gly Glu Ala Arg Val Ala Thr Tyr He Asp Ala Tyr 
85 90 95 

25 

Asn Tyr Ala Thr Lys Leu Asp Glu Met Arg Gin Glu Leu Glu Ala Ala 
100 105 110 

Val Gin Ala Lys Asp Leu Glu Lys Ala Glu Gin Tyr Tyr His Lvs He 
30 lis 120 125 130 

Pro Tyr Glu He Lys Thr Arg Thr Val He Leu Asp Arg Val Tyr Glv 
135 140 145 

35 Lys Thr Thr Arg Asp Leu Leu Arg Ser Thr Phe Lys Ala Lys Ala Gin 
150 155 160 

Glu Leu Arg Asp Ser Leu He Tyr Asp He Thr Val Ala Met Lys Ala 
X6S 170 175 

40 

Arg Glu Val Gin Asp Ala Val Lys Ala Gly Asn Leu Asp Lys Ala Lys 
ISO 185 190 

Ala Ala Val Asp Gin He Asn Gin Tyr Leu Pro Lys Val Thr Asp Ala 
"5 195 200 205 210 

Phe Lys Thr Glu Leu Thr Glu Val Ala Lys Lys Ala Leu Asp Ala Asp 
215 220 225 

so Glu Ala Ala Leu Thr Pro Lys Val Glu Ser Val Ser Ala He Asn Thr 
230 235 240 

Gin Asn Lys Ala Val Glu Leu Thr Ala Val Pro Val Asn Gly Thr Leu 
245 250 255 

Lys Leu Gin Leu Ser Ala Ala Ala Asn Glu Asp Thr Val Asn Val Asn 
260 265 270 

^-^^ '^^ Asn He Pro Phe Ala Leu Asn 

" 275 2B0 285 290 

Thr Ala Asp Val Ser Leu Ser Thr Asp Gly Lys Thr He Thr Val Asp 
295 300 305 

65 Ala Ser Thr Pro Phe Glu Asn Asn Thr Glu Tyr Lys Val Val Val Lvs 
310 315 320 
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Gly He Lys Asp Lys Asn Gly Lys Glu Phe Lys Glu Asp Ala Phe Thr 
325 330 335 

Phe Lys Leu Arg Asn Asp Ala Val Val Thr Gin Val Phe Gly Thr Asn 
5 340 345 350 

Val Thr Asn Asn Thr Ser Val Asn Leu Ala Ala Gly Thr Phe Asp Thr 
355 360 365 370 

10 Asp Asp Thr Leu Thr Val Val Phe Asp Lys Leu Leu Ala Pro Glu Thr 

375 380 385 

Val Asn Ser Ser Asn Val Thr He Thr Asp Val Glu Thr Gly Lys Arg 
390 395 400 

15 

He Pro Val He Ala Ser Thr Ser Gly Ser Thr He Thr He Thr Leu 
405 410 415 

Lys Glu Ala Leu Val Thr Gly Lys Gin Tyr Lys Leu Ala He Asn Asn 
20 420 425 . 430 

m Val Lys Thr Leu Thr Gly Tyr Asn Ala Glu Ala Tyr Glu Leu Val Phe 

1=^ 435 440 445 450 

Lj 25 Thr Ala Asn Ala Ser Ala Pro Thr Val Ala Thr Ala Pro Thr Thr Leu 
^ -455 ■ . 4S0 465 

=P Gly Gly Thr Thr Leu Ser Thr Gly Ser Leu Thr Thr Asn Val Trp Gly 

M 470 475 480 

3 0 

Lys Leu Ala Gly Gly Val Asn Glu Ala Gly Thr Tyr Tyr Pro Gly Leu 
L, 485 490 495 

O Gin Phe Thr Thr Thr Phe Ala Thr Lys Leu Asp Glu Ser Thr Leu Ala 

llj 35 500 505 510 

Asp Asn Phe Val Leu Val Glu Lys Glu Ser Gly Thr Val Val Ala Ser 
'■^ 515 520 525 530 

40 Glu Leu Lys Tyr Asn Ala Asp Ala Lys Met Val Thr Leu Val Pro Lys 

535 540 545 

Ala Asp Leu Lys Glu Asn Thr He Tyr Gin He Lys He Lys Lys Gly 
550 555 560 

Leu Lys Ser Asp Lys Gly He Glu Leu Gly Thr Val Asn Glu Lys Thr 
565 570 575 

Tyr Glu Phe Lys Thr Gin Asp Leu Thr Ala Pro Thr Val He Ser Val 
50 580 535 590 

Thr Ser Lys Asn Gly Asp Ala Gly Leu Lys Val Thr Glu Ala Gin Glu 
595 600 605 610 

55 Phe Thr Val Lys Phe Ser Glu Asn Leu Asn Thr Phe Asn Ala Thr Thr 

615 520 625 

Val Ser Gly Ser Thr He Thr Tyr Gly Gin Val Ala Val Val Lys Ala 
630 635 640 

CD 

Gly Ala Asn Leu Ser Ala Leu Thr Ala Ser Asp He He Pro Ala Ser 
645 650 655 

Val Glu Ala Val Thr Gly Gin Asp Gly Thr Tyr Lys Val Lys Val Ala 
fis 660 665 • 670 
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Ala Asn Gin Leu Glu Arg Asn Gin Gly Tyr Lys Leu Val Val Phe Gly 
675 680 68S 690 

Lys Gly Ala Thr Ala Pro Val Lys Asp Ala Ala Asn Ala Asn Thr Leu 
s 695 700 70S 

Ala Thr Asn Tyr lie Tyr Thr Phe Thr Thr Glu Gly Gin Asp Val Thr 
■710 715 720 

10 Ala Pro Thr Val Thr Lys Val Phe Lys Gly Asp Ser Leu Lys Asp Ala 
725 730 735 

Asp Ala Val Thr Thr Leu Thr Asn Val Asp Ala Gly Gin Lys Phe Thr 
740 745 750 

15 

lie Gin Phe Ser Glu Glu Leu Lys Thr Ser Ser Gly Ser Leu Val Gly 
755 750 765 770 

Gly Lys Val Thr Val Glu Lys Leu Thr Asn Asn Gly Trp Val Asp Ala 
20 775 780 785 

Gly Thr Gly Thr Thr Val Ser Val Ala Pro Lys Thr Asp Ala Asn Gly 
790 795 800 

25 Lys Val Thr Ala Ala Val Val Thr Leu Thr Gly Leu Asp Asn Asn Asp 
805 810 815 

Lys Asp Ala Lys Leu Arg Leu Val Val Asp Lys Ser Ser Thr Asp Gly 
820 S25 830 

30 

He Ala Asp Val Ala Gly Asn Val He Lys Glu Lys Asp He Leu He 
835 840 845 850 

Arg Tyr Asn Ser Trp Arg His Thr Val Ala Ser Val Lys Ala Ala Ala 
35 855 860 865 

Asp Lys Asp Gly Gin Asn Ala Ser Ala Ala Phe Pro Thr Ser Thr Ala 
870 875 880 

40 He Asp Thr Thr Lys Ser Leu Leu Val Glu Phe Asn Glu Thr Asp Leu 
885 890 895 

Ala Glu Val Lys Pro Glu Asn He Val Val Lys Asp Ala Ala Gly Asn 
900 90S 910 

45 

Ala Val Ala Gly Thr Val Thr Ala Leu Asp Gly Ser Thr Asn Lys Phe 
915 920 925 930 

Val Phe Thr Pro Ser Gin Glu Leu Lys Ala Gly Thr Val Tyr Ser Val 
50 935 940 945 

Thr He Asp Gly Val Arg Asp Lys Val Gly Asn Thr He Ser Lys Tyr 
950 955 960 

S5 He Thr Ser Phe Lys Thr Val Ser Ala Asn Pro Thr Leu Ser Ser He 
965 970 975 

Ser He Ala Asp Gly Ala Val Asn Val Asp Arg Ser Lys Thr He Thr 
980 985 990 

60 

He Glu Phe Ser Asp Ser Val Pro Asn Pro Thr He Thr Leu Lys Lys 
995 1000 1005 1010 

Ala Asp Gly Thr Ser Phe Thr Asn Tyr Thr Leu Val Asn Val Asn Asn 
65 1015 - 1020 1025 
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Glu Asn Lys Thr Tyr Lys He Val Phe His Lys Gly Val Thr Leu Asr, 
ll^^O 1035 1040 

Glu Phe Thr Gin Tyr Glu Leu Ala Val Ser Lys Asp Phe Gin Thr Glv 
1045 1050 1055 

Thr Asp He Asp Ser Lys Val Thr Phe He Thr Gly Ser Val Ala Thr 
1060 10S5 1070 

Asp Glu Val Lys Pro Ala Leu Val Gly Val Gly Ser Trp Asn Gly Thr 

1080 1085 1090 

Ser Tyr Thr Gin Asp Ala Ala Ala Thr Arg Leu Arg Ser Val Ala Asp 
^055 1100 1105 

Phe Val Ala Glu Pro Val Ala Leu Gin Phe Ser Glu Gly He Asp Leu 
mo 1115 1120 

Thr Asn Ala Thr Val Thr Val Thr Asn He Thr Asp Asp Lys Thr Val 
1125 1130 1-^35 

^■^^ yf}«^-^® Asp Ala Asp His Asp Ala Gly Ala 

1140 1145 1150 

Thr Lys Glu Thr Leu Val He Asn Thr Val Thr Pro Leu Val Leu Asp 
^^5S 1160 . 1165 1170 

Asn Ser -Lys Thr Tyr Lys He Val Val Ser Gly Val Lys Asp Ala Ala 
1175 iiao 1185 

Gly Asn Val Ala Asp Thr He Thr Phe Tyr He Lvs 
1190 1195 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single strand 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 3: 
TTAATCGATT CTAGATGGAT AGGAAAAAAG CTG 3 3 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single strand 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4: 



ATACCCGGGG GTACGGATCC GATACAGATT TGAGCAA 



37 
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(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2766 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(i) INITIAL ORIGIN: 

(A) ORGANISM: Bacillus stearothermophilus 

(B) STRAIN: PV72 

(vii) IMMEDIATE ORIGIN: 

(B) CLONE (S) : sbsB 

C ix) CHARACTERISTIC : 

(A) NAME/KEY: CDS 

(B) POSTION: 1..2763 

( ix) CHARACTERISTIC : 

(A) NAME/KEY: sig_peptide 

(B) POSTION: 1..93 

( ix) CHARACTERISTIC : 

(A) NAME/KEY: mat^peptide 

(B) POSTION: 94.. 2763 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO; 5: 



ATG GCT TAT CAA CCT AAG TCT TTT CGC AAG TTT GTT GCG ACA ACT GCA 4 8 

Met Ala Tyr Gin Pro Lys Ser Phe Arg Lys Phe Val Ala Thr Thr Ala 

-31 -30 -25 -20 

ACA GCT GCC ATT GTA GCA TCT GCG GTA GCT CCT GTA GTA TCT GCA GCA 96 

Thr Ala Ala lie Val Ala Ser Ala Val Ala Pro Val Val Ser Ala Ala 
-15 -10 -5 1 

AGC TTC ACA GAT GTT GCG CCG CAA TAT AAA GAT GCG ATC GAT TTC TTA 14 4 

Ser Phe Thr Asp Val Ala Pro Gin Tyr Lys Asp Ala lie Asp Phe Leu 
5 10 15 

GTA TCA ACT GGT GCA ACA AAA GGT AAA ACA GAA ACA AAA TTC GGC GTT ' 192 

Val Ser Thr Gly Ala Thr Lys Gly Lys Thr Glu Thr Lys Phe Gly Val 
20 25 30 

TAG GAT GAA ATC ACT CGT CTA GAT GCG GCA GTT ATT CTT GCA AGA GTA 24 0 

Tyr Asp Glu He Thr Arg Leu Asp Ala Ala Val He Leu Ala Arg Val 

35 40 45 
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TTA AAA CTA GAC GTT GAC AAC GCA AAA GAG GCA GGC TTC ACA GAT GTG 2 SB 

Leu Lys Leu Asp Val Asp Asn Ala Lys Asp Ala Gly Phe Thr Asp Val 
SO 55 60 65 

5 CCA AAA GAC CGT GCA AAA TAC GTC AAC GCG CTT GTA GAA GCT GGC GTA 3 36 

Pro Lys Asp Arg Ala Lys Tyr Val Asn Ala Leu Val Glu Ala Gly Val 
70 75 80 

TTA AAC GGT AAA GCA OCT GGC AAA TTT GGT GCA TAC GAC CCA TTA ACT 3 84 

10 Leu Asn Gly Lys Ala Pro Gly Lys Phe Gly Ala Tyr Asp Pro Leu Thr 
85 90 95 

CGC GTT GAA ATG GCA AAA ATC ATC GCG AAC CGT TAC AAA TTA AAA GCT 432 
Arg Val Glu Met Ala Lys He He Ala Asn Arg Tyr Lys Leu Lys Ala 
15 100 105 110 

GAC GAT GTA AAA CTT CCA TTC ACT GAT GTA AAC GAT ACA TGG GCA CCA 480 
Asp Asp Val Lys Leu Pro Phe Thr Asp Val Asn Asp Thr Trp Ala Pro 
lis 120 125 

20 

TAC GTA AAA GCG CTT TAT AAA TAC GAA GTA ACC AAA AGG TTA AAA CAC 52 8 

Tyr Val Lys Ala Leu Tyr Lys Tyr Glu Val Thr "Lys Arg Leu Lys His 
130 135 140 145 

25 CAA CAA GCT TCG GTG CAT ACC AAA AAC ATC ACT CTG CGT GAC TTT GCG 576 
Gin Gin Ala Ser Val His Thr Lys Asn.- He Thr Leu Arg Asp Phe Ala 
150 155 160 

CAA TTT GTA TAT AGA GCG GTG AAT ATT AAT GCA GTG CCA GAA ATA GTT 624 
30 Gin Phe Val Tyr Arg Ala Val Asn He Asn Ala Val Pro Glu He Val 
165 170 175 

GAA GTA ACT GCG GTT AAT TCG ACT ACA GTG AAA GTA ACA TTC AAT ACG 67 2 

Glu Val Thr Ala Val Asn Ser Thr Thr Val Lys Val Thr Phe Asn Thr 
35 180 185 190 

CAA ATT. GCT GAT GTT GAT TTC ACA AAT TTT GCT ATC GAT AAC GGT TTA 72 0 

Gin He Ala Asp Val Asp Phe Thr Asn Phe Ala He Asp Asn Gly Leu 

195 200 205 

40 

ACT GTT ACT AAA GCA ACT CTT TCT CGT GAT AAA AAA TCC GTA GAG GTT 768 

Thr Val Thr Lys Ala Thr Leu Ser Arg Asp Lys Lys Ser Val Glu Val 
210 215 220 225 

45 GTG GTA AAT AAA CCG TTT ACT CGT AAT CAG GAA TAT ACA ATT ACA GCG 816 
Val Val Asn Lys Pro Phe Thr Arg Asn Gin Glu Tyr Thr He Thr Ala 
230 235 240 

ACA GGC ATT AAA AAT TTA AAA GGC GAG ACC GCT AAG GAA TTA ACT GGT 864 
50 Thr Gly He Lys Asn Leu Lys Gly Glu Thr Ala Lys Glu Leu Thr Gly 
245 250 255 

AAG TTT GTT TGG TCT GTT CAA GAT GCG GTA ACT GTT GCA CTA AAT AAT 912 
Lys Phe Val Trp Ser Val Gin Asp Ala Val Thr Val Ala Leu Asn Asn 
55 260 265 270 

AGT TCG CTT AAA GTT GGA GAG GAA TCT GGT TTA ACT GTA AAA GAT CAG 960 
Ser Ser Leu Lys Val Gly Glu Glu Ser Gly Leu Thr Val Lys Asp Gin 
275 280 285 

60 

GAT GGC AAA GAT GTT GTA GGT GCT AAA GTA GAA CTT ACT TCT TCT AAT 100 8 

Asp Gly Lys Asp Val Val Gly Ala Lys Val Glu Leu Thr Ser Ser Asn 

295 300 305 

6 5 ACT AAT ATT GTT GTA GTT TCA AGT GGC GAA GTA TCA GTA TCT GCT GCT 105 6 

Thr Asn He Val Val Val Ser Ser Gly Glu Val Ser Val Ser Ala Ala 
310 315 320 
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AAA GTT ACA GCT GTA AAA CCG GGA ACA GCT GAT GTT ACT GCA AAA GTT 

Lys Val Thr Ala Val Lys Pro Gly Thr Ala Asp Val Thr Ala Lys Val 

325 330 335 

5 ACA TTA CCA GAT GGT GTT GTA CTA ACA AAT ACA TTT AAA GTG ACA GTT 

Thr Leu Pro Asp Gly Val Val Leu Thr Asn Thr Phe Lys Val Thr Val 
340 345 3^0 

ACA GAA GTG CCT GTT CAA GTC CAA AAT CAA GGA TTT ACT TTA GTT GAT 

10 Thr Glu Val Pro Val Gin Val Gin Asn Gin Gly Phe Thr Leu Val Asp 
355 360 365 

AAT CTT TCT AAT GCT CCA CAG AAT ACA GTT GCA TTT AAC AAA GCT GAG 

Asn Leu Ser Asn Ala Pro Gin Asn Thr Val Ala Phe Asn Lye Ala Glu 

15 370 375 ■ 380 385 



20 



2 5 GCA ACT GTA CGT TCA TTA AAT CCA ATT ATT GCA ACA GCT GCT ATT AAT 

_z Ala Thr Val Arg Ser Leu Asn Pro Ilq. He Ala Thr Ala Ala He Asn 

X 425 " 430 

'^,4 GGT AGT GAG CTC CTT GTC ACA GCT AAT GCT GGC CAA TCT GGA AAA GCT 

30 Gly Ser Glu Leu Leu Val Thr Ala Asn Ala Gly Gin Ser Gly Lys Ala 

L 440 445 

m TCA TTT GAA GTA ACA TTA AAA GAT AAT ACA AAA AGA ACA TTT ACA GTT 
Ser Phe Glu Val Thr Leu Lys Asp Asn Thr Lys Arg Thr Phe Thr Val 



40 



TCT GTT AAA CTT TCC GAT GAA GCT GTT GGC GGC GGG GAA GTT GAA GGA 
Ser Val Lys Leu Ser Asp Glu Ala Val Gly Gly Gly Glu Val Glu Glv 
485 490 495 



1104 



1152 



1200 



1248 



AAA GTA ACT TCA ATG TTT GCT GGA GAA ACT AAA ACA GTT GCA ATG TAT 12 96 

Lys Val Thr Ser Met Phe Ala Gly Glu Thr Lys Thr Val Ala Met Tyr 
390 395 400 

GAT ACT AAA AAC GGT GAT CCT GAA ACT AAA CCT GTT GAT TTC AAA GAT 1344 
Asp Thr Lys Asn Gly Asp Pro Glu Thr Lys Pro Val Asp Phe Lys Asp 
405 410 415 



1392 



1440 



1488 



111 455 460 465 

;y GAT GTA AAA AAA GAC CCT GTA TTA CAA GAT ATA AAA GTA GAT GCA ACT 1536 

Asp Val Lys Lys Asp Pro Val Leu Gin Asp He Lys Val Asp Ala Thr 
470 475 480 



1584 



GTT AAC CAA AAA ACG ATT AAA GTA AGT GCA GTT GAC CAA TAC GGT AAA 163 2 

Val Asn Gin Lys Thr He Lys Val Ser Ala Val Asp Gin Tyr Glv Lvs 
500 505 510 

GAA ATT AAA TTT GGT ACA AAA GGT AAA GTT ACT GTT ACA ACT AAT ACA 
50 Glu He Lys Phe Gly Thr Lys Gly Lys Val Thr Val Thr Thr Asn Thr 
515 520 525 

GAA GGA CTA GTT ATT AAA AAT GTA AAT AGC GAT AAT ACA ATT GAC TTT 

^ ^^"^ ^^■^ ^-^^ Ser Asp Asn Thr He Asp Phe 

" ^-^^ 535 540 545 

GAT AGC GGC AAT AGT GCA ACT GAC CAA TTT GTT GTC GTT GCA ACA AAA 
Asp Ser Gly Asn Ser Ala Thr Asp Gin Phe Val Val Val Ala Thr Lys 
550 555 550 

GAC AAA ATT GTC AAT GGT AAA GTA GAA GTT AAA TAT TTC AAA AAT GCT 
Asp Lys He Val Asn Gly Lys Val Glu Val Lys Tyr Phe Lys Asn Ala 
565 570 575 

6 5 AGT GAC ACA ACA CCA ACT TCA ACT AAA ACA ATT ACT GTT AAT GTA GTA 
Ser Asp Thr Thr Pro Thr Ser Thr Lys Thr He Thr Val Asn Val Val 
580 585 590 



1680 



1728 



1776 



1824 



1872 
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AAT GTA AAA GCT GAC GCT ACA CCA GTA GGA TTA GAT ATT GTA GCA CCT 
Asn Val Lys Ala Asp Ala Thr Pro Val Gly Leu Asp He Val Ala Pro 
595 600 S05 



1920 



5 TCT AAA ATT GAT GTA AAT GCT CCA AAC ACT GCT TCT ACT GCA GAT GTT 196 8 

Ser Lys He Asp Val Asn Ala Pro Asn Thr Ala Ser Thr Ala Asp Val 

615 620 625 

GAT TTT ATA AAT TTC GAA AGT GTT GAG ATT TAG ACA CTC GAT TCA AAT 2016 
10 Asp Phe He Asn Phe Glu Ser Val Glu He Tyr Thr Leu Asp Ser Asn 

630 635 

GGT AGA CGT CAA AAA AAA GTT ACT CCA ACT GCA ACT ACA CTT GTA GGT 2 0 64 

Gly Arg Arg Gin Lys Lys Val Thr Pro Thr Ala Thr Thr Leu Val Glv 
IS 645 650 555 

ACA AAA AAA AAA AAA AAA GTT AAT GGG AAT GTA TTA CAA TTC AAG GGG 2112 
Thr Lys Lys Lys Lys Lys Val Asn Gly Asn Val Leu Gin Phe Lys Gly 
660 665 670 



20 



AAC GAA GAA TTA ACG CTA TCA ACT TCT TCT AGT ACA GGA AAC GTA GAT 2160 
Asn Glu Glu Leu Thr Leu Ser Thr Ser Ser Ser Thr Gly Asn Val Asp 
675 680 585 

25 GGA ACA. GCA GAA GGA ATG ACA AAA CGT ATT CCA GGG AAA TAT ATC AAC 2 ''0 8 

Gly Thr Ala Glu Gly Met Thr Lys. Arg He Pro Gly Lys Tyr He Asn 
690 695 700 705 

TCT GCA AGT GTA CCT GCC AGT GCA ACA GTA GCA ACA AGT CCT GTT ACT 22 5 6 

30 Ser Ala Ser Val Pro Ala Ser Ala Thr Val Ala Thr Ser Pro Val Thr 

710 715 720 

GTA AAG CTT AAT TCA AGT GAT AAT GAT TTA ACA TTT GAA GAA TTA ATA 2 3 04 

Val Lys Leu Asn Ser Ser Asp Asn Asp Leu Thr Phe Glu Glu Leu He 
3S 725 730 735 

TTC GGT GTA ATT GAC CCT ACA CAA TTA GTC AAA GAT GAA GAC ATC AAC 2 3 52 

Phe Gly Val He Asp Pro Thr Gin Leu Val Lys Asp Glu Asp He Asn 
740 745 750 



40 



GAA TTT ATT GCA GTT TCA AAA GCG GCT AAA AAT GAT GGA TAT TTG TAT 2 4 00 

Glu Phe He Ala Val Ser Lys Ala Ala Lys Asn Asp Gly Tyr Leu Tyr 
755 760 765 

4 5 AAT AAA CCG CTT GTA ACG GTT AAA GAT GCA TCA GGA AAA GTT ATT CCA 244 8 

Asn Lys Pro Leu Val Thr Val Lys Asp Ala Ser Gly Lys Val He Pro 

775 780 785 

ACA GGT GCA AAT GTT TAG GGT CTA AAT CAT GAT GCA ACT AAC GGA AAC 24 96 

50 Thr Gly Ala Asn Val Tyr Gly Leu Asn His Asp Ala Thr Asn Gly Asn 

790 795 800 

ATT TGG TTT GAT GAG GAA CAA GCT GGC TTA GCT AAA AAA TTT AGT GAT 2 54 4 

He Trp Phe Asp Glu Glu Gin Ala Gly Leu Ala Lys Lys Phe Ser Asp 
5S 805 BIO 815 

GTA CAT TTT GAT GTT GAT TTT TCA TTA ACT AAC GTT GTA AAA ACT GGT 2 592 

Val His Phe Asp Val Asp Phe Ser Leu Thr Asn Val Val Lys Thr Glv 
820 825 830 



60 



AGC GGT ACA GTT TCT TCA TCG CCA TCA TTA TCT GAC GCA ATT CAA CTT 264 0 

Ser Gly Thr Val Ser Ser Ser Pro Ser Leu Ser Asp Ala He Gin Leu 
835 840 845 

" ACT AAT TCA GGC GAT GCA GTA TCG TTT ACA TTA GTT ATC AAA TCA ATT 26 88 

Thr Asn Ser Gly Asp Ala Val Ser Phe Thr Leu Val He Lys Ser He 
. S55 860 855 
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TAT GTT AAA GGC GCA GAT AAA GAT GAT AAT AAC TTA CTT GCA GCC OCT 2736 

Tyr Val Lys Gly Ala Asp Lys Asp Asp Asn Asn Leu Leu Ala Ala Pro 
870 875 880 

5 GTT TCT GTC AAT GTG ACT GTG ACA AAA TAA P7fifi 
Val Ser Val Asn Val Thr Val Thr Lys 
885 890 



10 (2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 921 amino acids 

(B) TYPE: amino acid 
15 (D) TOPOLOGY: linear 

(ii) TYPE OF MOLECULE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

20 



Met Ala Tyr Gin Pro Lys Ser Phe Ard[ Lys Phe Val Ala Thr Thr Ala 
25 -31 -30 . -25 - -20 

Thr Ala Ala He Val Ala Ser Ala Val Ala Pro Val Val Ser Ala Ala 
-15 -10 -5 1 

30 Ser Phe Thr Asp Val Ala Pro Gin Tyr Lys Asp Ala lie Asp Phe Leu 
5 10 15 

Val Ser Thr Gly Ala Thr Lys Gly Lys Thr Glu Thr Lys Phe Gly Val 
20 25 30 

35 

Tyr Asp Glu He Thr Arg Leu Asp Ala Ala Val He Leu Ala Arg Val 
35 40 45 

Leu Lys Leu Asp Val Asp Asn Ala Lys Asp Ala Gly Phe Thr Asp Val 
50 55 60 65 

Pro Lys Asp Arg Ala Lys Tyr Val Asn Ala Leu Val Glu Ala Gly Val 
70 75 BO 

45 Leu Asn Gly Lys Ala Pro Gly Lys Phe Gly Ala Tyr Asp Pro Leu Thr 
85 90 95 

Arg Val Glu Met Ala Lys He He Ala Asn Arg Tyr Lys Leu Lys Ala 
100 105 l-^Q 

50 

Asp Asp Val Lys Leu Pro Phe Thr Asp Val Asn Asp Thr Trp Ala Pro 
115 120 125 

Tyr Val Lys Ala Leu Tyr Lys Tyr Glu Val Thr Lys Arg Leu Lys His 
" 1"^° 135 140 145 

Gin Gin Ala Ser Val His Thr Lys Asn He Thr Leu Arg Asp Phe Ala 
ISO 155 160 

60 Gin Phe Val Tyr Arg Ala Val Asn He Asn Ala Val Pro Glu He Val 
165 170 175 

Glu Val Thr Ala Val Asn Ser Thr thx Val Lys Val Thr Phe Asn Thr 
leo 185 190 
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Gin He Ala Asp Val Asp Phe Thr Asn Phe Ala He Asp Asn Gly Leu 
195 200 205 

Thr Val Thr Lys Ala Thr Leu Ser Arg Asp Lys Lys Ser Val Glu Val 
5 210 215 220 , 225 

Val Val Asn Lys Pro Phe Thr Arg Asn Gin Glu Tyr Thr He Thr Ala 
230 235 240 

10 Thr Gly He Lys Asn Leu Lys Gly Glu Thr Ala Lys Glu Leu Thr Glv 
245 250 255 

Lys Phe Val Trp Ser Val Gin Asp Ala Val Thr Val Ala Leu Asn Asn 
260 265 270 

IS 

Ser Ser Leu Lys Val Gly Glu Glu Ser Gly Leu Thr Val Lys Asp Gin 
275 280 285 

Asp Gly Lys Asp Val Val Gly Ala Lys Val Glu Leu Thr Ser Ser Asn 
20 290 295 300 305 

Thr Asn He Val Val Val Ser Ser Gly Glu Val Ser Val Ser Ala Ala 
310 315 320 

25 Lys Val Thr Ala Val Lys Pro Gly Thr Ala Asp Val Thr Ala Lys Val 
325 ,330 335 

Thr Leu Pro Asp Gly Val Val Leu Thr Asn Thr Phe Lys Val Thr Val 
340 345 350 

30 

Thr Glu Val Pro Val Gin Val Gin Asn Gin Gly Phe Thr Leu Val Asp 
355 360 3S5 

Asn Leu Ser Asn Ala Pro Gin Asn Thr Val Ala Phe Asn Lys Ala Glu 
35 370 375 380 385 

Lys Val Thr Ser Met Phe Ala Gly Glu Thr Lys Thr Val Ala Met Tyr 
390 395 400 

4 0 Asp Thr Lys Asn Gly Asp Pro Glu Thr Lys Pro Val Asp Phe Lys Asp 
405 410 415 

Ala Thr Val Arg Ser Leu Asn Pro He He Ala Thr Ala Ala He Asn 
420 425 430 

45 

Gly Ser Glu Leu Leu Val Thr Ala Asn Ala Gly Gin Ser Glv Lys Ala 
435 440 445 

Ser Phe Glu Val Thr Leu Lys Asp Asn Thr Lys Arg Thr Phe Thr Val 
^° -350 455 460 465 

Asp Val Lys Lys Asp Pro Val Leu Gin Asp He Lys Val Asp Ala Thr 
470 475 4B0 

55 Ser Val Lys Leu Ser Asp Glu Ala Val Gly Gly Gly Glu Val Glu Gly 
485 490 495 

Val Asn Gin Lys Thr He Lys Val Ser Ala Val Asp Gin Tyr Gly Lvs 
500 505 510 ^ ^ 

60 

Glu He Lys Phe Gly Thr Lys Gly Lys Val Thr Val Thr Thr Asn Thr 
515 520 525 

Glu Gly Leu Val He Lys Asn Val Asn Ser Asp Asn Thr He Asp Phe 
«5 530 . 



# 
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Asp Ser Gly Asn Ser Ala Thr Asp Gin Phe Val Val Val Ala Thr Lys 
550 555 560 

Asp Lys lie Val Asn Gly Lys Val Glu Val Lys Tyr Phe Lys Asn Ala 
5 565 570 575 

Ser Asp Thr Thr Pro Thr Ser Thr Lys Thr lie Thr Val Asn Val Val 
580 585 590 

10 Asn Val Lys Ala Asp Ala Thr Pro Val Gly Leu Asp lie Val Ala Pro 
595 600 605 

Ser Lys lie Asp Val Asn Ala Pro Asn Thr Ala Ser Thr Ala Asp Val 
610 615 620 625 

15 

Asp Phe lie Asn Phe Glu Ser Val Glu lie Tyr Thr Leu Asp Ser Asn 
630 635 640 

Gly Arg Arg Gin Lys Lys Val Thr Pro Thr Ala Thr Thr Leu Val Gly 
20 645 650 655 

Thr Lys Lys Lys Lys Lys Val Asn Gly Asn Val Leu Gin Phe Lys Gly 
660 665 670 

2S Asn Glu Glu. Leu Thr Leu Ser Thr Ser Ser Ser Thr Gly Asn Val Asp 
675 680 . 6B5 

Gly Thr Ala Glu Gly Met Thr Lys Arg He Pro Gly Lys Tyr He Asn 
690 695 700 705 

30 

Ser Ala Ser Val Pro Ala Ser Ala Thr Val Ala Thr Ser Pro Val Thr 
710 715 720 

Val Lys Leu Asn Ser Ser Asp Asn Asp Leu Thr Phe Glu Glu Leu He 
35 725 730 735 

Phe Gly Val He Asp Pro Thr Gin Leu Val Lys Asp Glu Asp He Asn 
740 745 750 

40 Glu Phe He Ala Val Ser Lys Ala Ala Lys Asn Asp Gly Tyr Leu Tvr 
755 760 765 

Asn Lys Pro Leu Val Thr Val Lys Asp Ala Ser Gly Lys Val He Pro 
770 775 780 7B5 

45 

Thr Gly Ala Asn Val Tyr Gly Leu Asn His Asp Ala Thr Asn Gly Asn 
790 795 800 

He Trp Phe Asp Glu Glu Gin Ala Gly Leu Ala Lys Lys Phe Ser Asp 
50 805 810 815 

Val His Phe Asp Val Asp Phe Ser Leu Thr Asn Val Val Lys Thr Gly 
820 825 830 

55 Ser Gly Thr Val Ser Ser Ser Pro Ser Leu Ser Asp Ala He Gin Leu 
835 840 B45 

Thr Asn Ser Gly Asp Ala Val Ser Phe Thr Leu Val He Lys Ser He 
850 855 860 865 

60 

Tyr Val Lys Gly Ala Asp Lys Asp Asp Asn Asn Leu Leu Ala Ala Pro 
870 B75 880 

Val Ser Val Asn Val Thr Val Thr Lys 
6S 885 p^90 
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(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 498 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 





CCCATGGACC 


CGTCCAAGGA 


CTCCAAAGCT 


CAGGTTTCTG 


CAGCCGAAGC 


TGGTATCACT 


GO 




GGCACCTGGT 


ATAACCAACT 


GGGGTCGACT 


TTCATTGTGA 


CCGCTGGTGC 


GGACGGAGCT 


120 




CTGACTGGCA 


CCTACGAATC 


TGCGGTTGGT 


AACGCAGAAT 


CCCGCTACGT 


ACTGACTGGC 


180 


. ^ 


CGTTATGACT 


CTGCACCTGC 


CACCGATGGC 


TCTGGTACCG 


CTCTGGGCTG 


GACTGTGGCT 


240 




TGGAAAAACA 


ACTATCGTAA 


TGCGCACAGC 


GCCACTACGT 


GGTCTGGCCA 


ATACGTTGGC 


300 




GGTGCTGAGG 


CTCGTATCAA 


CACTCAGTGG 


CTGTTAACAT 


CCGGCACTAC 


CGAAGCGAAT 


360 




GCATGGAAAT 


CGACACTAGT 


AGGTCATGAC 


ACCTTTACCA 


AAGTTAAGCC 


TTCTGCTGCT 


420 




AGCATTGATG 


CTGCCAAGAA 


AGCAGGCGTA 


AACAACGGTA 


ACCCTCTAGA 


CGCTGTTCAG 


480 




CAATAATAAG 


GATCCGGG 










498 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single strand 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8 
TTCATCGT/-A ACGCCGAATT TTGTTTCTG 29 



INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single strand 

(D) TOPOLOGY; linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
AGGGAAATAT ATCAACTCTG CAAGTG 26 



